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ABSTRACT: The structure of the envelope protein El of two coronaviruses, mouse hepatitis virus strain A59 
and infectious bronchitis virus, was analyzed by applying several theoretical methods to their amino acid 
sequence. The results of these analyses combined with earlier data on the orientation and protease sensitivity 
of E1 assembled in microsomal membranes lead to a topological model. According to this model, the protein 
is anchored in the lipid bilayer by three successive membrane-spanning helices present in its N-terminal 
half whereas the C-terminal part is thought to be associated with the membrane surface; these interactions 
with the membrane protect almost the complete polypeptide against protease digestion. In addition, it is 
predicted that the insertion of El into the membrane occurs by the recognition of the internal transmembrane 


region(s) as a signal sequence. 


Pcie synthesized in eukaryotic cells have divergent des- 
tinations. Either they stay in the cytoplasm, or they are 
transported to organelles, to the plasma membrane, or out of 
the cell. Little is known about the determinants that direct 
a protein to its particular destination. It is generally assumed 
that such determinants reside in the protein’s specific structure. 
Recently, we have presented the El glycoprotein of coronavirus 
mouse hepatitis virus A59 (MHV-A59)! as a model intra- 
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cellular protein, the study of which might give insight into the 
targeting principles of such proteins (Rottier et al., 1984, 1985; 
Armstrong et al., 1984). 

In contrast to most other enveloped RNA viruses, corona- 
viruses bud into the endoplasmic reticulum (ER) (Holmes & 
Behnke, 1981; Niemann et al., 1982; Tooze et al., 1984). This 
particular budding site is determined by the envelope glyco- 


' Abbreviations: ER, endoplasmic reticulum; IBV, infectious bron- 
chitis virus; MHV-A59, mouse hepatitus virus strain A59; SRP, sig- 
nal-recognition particle. 
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protein El, which accumulates in internal membranes after 
its synthesis in infected cells; it can be transported out of the 
cell but only as part of virions (Dubois-Dalcq et al., 1982; 
Niemann et al., 1982; Tooze et al., 1984). El of MHV-A59 
is an O-glycosylated protein (Rottier et al., 1981; Niemann 
& Klenk, 1981; Holmes et al., 1981). Its integration into 
membranes occurs without cleavage of an N-terminal signal 
sequence (Rottier et al., 1984); nevertheless, it requires a 
signal-recognition particle (SRP) (Rottier et al., 1985). 

A better understanding of the features of E1 is not possible 
without knowledge of its disposition in the membrane. Pro- 
tease digestion of El of in vitro synthesized MHV-AS9 re- 
moved only small portions from the NH, and COOH termini 
if the translation had been carried out in the presence of dog 
pancreatic microsomes, suggesting that the protein is largely 
buried within the lipid bilayer (Rottier et al., 1984). Subse- 
quent sequencing of the El gene of MHV-AS9 revealed that 
the N-terminal part of the protein is, indeed, very hydrophobic. 
However, it remained unclear how the quite hydrophilic 
COOH terminal part of the protein is protected from digestion 
(Armstrong et al., 1984). 

Here, we study the disposition of El in the membrane by 
combining biochemical data with an analysis of its primary 
structure. We also included in our study the El sequence of 
another coronavirus, the avian infectious bronchitis virus (IBV) 
(Boursnell et al., 1984). MHV and IBV are supposed to be 
only distantly related viruses. This is underlined by the facts 
that El of IBV is glycosylated through N linkages (Stern & 
Sefton, 1982; Cavanagh, 1983) and that in IBV-infected cells 
five subgenomic mRNAs are synthesized instead of six (Stern 
& Sefton, 1984). Nevertheless, it may be expected that the 
El proteins of both viruses have essentially the same membrane 
topology. This provides a check on the outcome of our analyses 
and may add to our knowledge of the interaction of proteins 
and membranes. 


EXPERIMENTAL PROCEDURES 


The methods used in this study have been published else- 
where and are referred to at the appropriate places in the text. 


RESULTS 


Comparison of MHV-A59 and IBV E1 Primary Structures. 
Comparison of the sequences of MHV-A59 and IBV (Beau- 
dette strain) El at their gene’s nucleic acid level revealed no 
appreciable homology (data not shown). At the amino acid 
level, however, the sequence could be aligned so as to reach 
a homology of about 27% (Figure 1). Some conservation of 
the protein appears thus to have occurred during the evolution 
of these distinct viruses. This conservation becomes more 
significant when its location within the proteins is considered. 
Inspection of Figure 1 reveals that the homology is not gen- 
erally scattered over the proteins but is concentrated largely 
in their N-terminal half. In the 123 N-terminal amino acids 
of MHV, the homology is 41%, while in the remainder of the 
protein it is only 17%. In more detail, it appears that the 
homology of the N-terminal parts of MHV and IBV is clus- 
tered within two regions (residues 34-68 and residues 
101-123), which are almost 70% homologous. 

Disposition of El in the Membrane. Of the numerous 
methods to predict secondary structures of proteins, only a few, 
recent ones were designed specifically for membrane proteins. 
One simple method to make predictions about the disposition 
of various parts of membrane proteins involves the calculation 
of the free-energy cost of burying successive stretches of amino 
acids of a protein from an aqueous environment into the hy- 
drophobic interior of a membrane (von Heijne, 1981a; En- 
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FIGURE 1: Amino acid sequences of MHV protein E1 (top sequence) 
and IBV protein El] (bottom sequence) aligned according to maxiinal 
homology. Identical amino acids are indicated by asterisks. Dotted 
and hatched areas represent regions for which transmembrane and 
surface helices are predicted, respectively. Regions with high B-bend 
probability are symbolized by open boxes. 


gelman & Steitz, 1981; Steitz et al., 1982). In the case of 
bacteriorhodopsin, this method confirmed the existence of the 
seven transmembrane segments previously established by 
biophysical methods (Henderson & Unwin, 1975; Engelman 
& Zaccai, 1980). Figure 2 shows the results of such calcu- 
lations with MHV and IBV Els on a segment length of 21 
residues and with the criteria of von Heijne (1981a). Despite 
the differences in primary structure, the profiles of both 
proteins are remarkably similar. In their N-terminal halves, 
the plots resemble those of bacterial rhodopsin (Steitz et al., 
1982) and bovine rhodopsin (Nathans & Hogness, 1983). The 
obvious interpretation here is the existence of three mem- 
brane-spanning segments. Interestingly, this approach was 
in part developed to analyze signal sequences (von Heijne, 
1981a,b, 1982) but yields highly positive peptide-membrane 
free energies for the N-terminal extreme of El, rendering a 
functioning as insertion signal virtually impossible. Another 
significant result concerns the C-terminal half of El. In both 
proteins, the free-energy value of this region is also positive. 
This indicates that for this part of the polypeptide a location 
within the lipid bilayer would be unfavorable on energetic 
grounds. 

Another elegant approach to identity and characterize in 
membrane proteins the segments interacting with the lipid 


bilayer was recently presented by Eisenberg and co-workers 


(Eisenberg et al., 1984). These authors developed an algorithm 
in which the hydrophobicity of a 21-residue moving segment 
run over the entire protein is used to detect and locate, by a 
repeated process of selection and disqualification, the trans- 
membrane a-helices. Application of this algorithm to the E1 
sequences of MHV and IBV resulted in the localization in both 
proteins of three such helices. Their positions, together with 
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Table I: Transmembrane and Surface Helices in the El Protein of MHV and IBV 


MHV 
21-residue 11-residue 

segment with segment with 

highest (H)*  (H)? max (jty)° (H)4 
first helix 26-46 0.71 36-46 0.67 
second helix 52-72 0.78 53-63 0.72 
third helix 83-103 0.85 86-96 0.85 
first surface helix 102-112 0.11 


second surface helix 


IBV 


11-residue 
segment with 
max (j4)° (Hy)? (uy)? 


21-residue 
segment with 
(a)® highest (HI)? (1)? 


0,38 21-41 0.66 31-41 0.62 0.29 
0.30 52-72 0.67 60-70 0.59 = 0.25 
0.34 78-98 0.80 81-91 0.84 0.23 
0.73 102-112 -0.48 0.91 

178-188 -0.42 0.83 


* Transmembrane helices localized on the basis of maximal mean hydrophobicity (()) according to Eisenberg et al. (1984); numbers represent the 
residue numbers in the proteins, Corresponding mean hydrophobicity values. The minimal value for a transmembrane segment is 0.42, provided 
that there are two segments with a summed value of 1.10. ‘Regions with maximal mean hydrophobic moment. ?Corresponding mean hydrophobicity 


values. ‘Corresponding mean hydrophobic moment values. 


AG free energy 


residue number 


FIGURE 2: Free energy [G (kJ/mol)] of insertion of successive 21 
amino acid segments as described by von Heijne (1981a). The energy 
is plotted as a function of the first amino acid in a segment. (A) MHV 
E1; (B) IBV El. The minima may represent the start of a mem- 
brane-spanning helix. 


their mean hydrophobicities, are given in Table I. The 
positions of the helices coincide with the three minima in the 
free-energy plot of Figure 2. It is noteworthy that the first 
and the third helices are predicted at exactly equivalent 
positions in both polypeptides. Eisenberg et al. also introduced 
the hydrophobic moment plot to classify membrane-associated 
helices. With this approach, the nature of a helix can be read 
from its position in the plot as determined by the values of 
hydrophobicity and hydrophobic moment, each calculated now 
with an 11-residue window. The data thus obtained are in- 
cluded in Table I. When positioned in the hydrophobic mo- 
ment plot of Eisenberg et al., the El transmembrane helices 
appear to lie in the region of the so-called multimeric helices, 
i.e., protein segments that occur cooperatively associated within 
the lipid bilayer. Interestingly, this method also predicted the 
existence of some so-called surface helices (Table I). In the 
C-terminal half of the El molecule, a few helices were found 
with the high hydrophobic moments that are characteristic 
of close interaction with the surface of the membrane. 
Having identified the membrane-associated parts in the El 
molecule, we finally searched for additional secondary struc- 
ture for the other parts of the protein using more classical 
prediction methods. Notably, the position of @-bends as 


calculated with the rules of Chou and Fasman (1978) and of 
Cid et al. (1982) supported previous conclusions. Figure | 
shows the position of seven 6-bends predicted by one or both 
methods for the IBV molecule, two of them conspicuously 
separating each pair of transmembrane helices. Only one of 
these latter was found in MHV El. Two other @-bends are 
located in close proximity at equivalent positions in both 
proteins in the conserved region next to the first surface helix. 
Besides the transmembrane segments, only a few a-helices 
were predicted by the method of Lim (1974). These predic- 
tions in IBV El (residues 178-190) and in the corresponding 
region of MHV E] agreed with the localization of the surface 
helix found in the El protein of IBV around residue 183. 


DISCUSSION 


Three main conclusions can be drawn from the analysis of 
the coronavirus El protein sequence presented in this paper. 
(i) The protein is anchored in the membrane through three 
successive transmembrane segments. (ii) The C-terminal half 
of the polypeptide is not incorporated within the lipid bilayer. 
(iii) The coronavirus El protein has an internal signal se- 
quence. 

Though the occurrence of multiple membrane-spanning 
segments seems not to be unusual among membrane proteins, 
to our knowledge coronavirus El presents the first viral ex- 
ample of this kind. Other comparable proteins have widely 
different origins, ranging from bacterial [e.g., bacterio- 
rhodopsin (Henderson & Unwin, 1975) and Escherichia coli 
lactose carrier protein (Foster et al., 1983)] to eukaryotic 
proteins [e.g., cytochrome P-450 (Heinemann & Ozols, 1982) 
and acetylcholine receptor subunits (e.g., Noda et al., 1983)]. 
The methods we have used in our study have proven their 
usefulness in predicting directly the existence and precise 
locations of the transmembrane segments in these other pro- 
teins (Eisenberg et al., 1984). For the coronavirus El trans- 
membrane segments, the values of hydrophobicity and hy- 
drophobic moment suggest that the helices are associated 
cooperatively within the lipid bilayer. 

An intriguing question is why El behaves as an intracellular 
protein accumulating mainly in reticular membranes in the 
absence of virus maturation. It seems that the occurrence of 
multiple spanning segments is not the explanation for this 
behavior since this type of structure also exists in plasma 
membrane proteins such as erythrocyte band III protein 
(Steck, 1978), rhodopsin (Nathans & Hogness, 1983; Dratz 
& Hargrave, 1983), and the acetylcholine receptor subunits 
[e.g., Noda et al. (1983)]. 

Insertion of E1 into membranes takes place without cleavage 
of a signal sequence (Rottier et al., 1984). Nevertheless, the 
requirement of SRP (Rottier et al., 1985) implies a specific 
recognition sequence. The N-terminus of the protein has none 
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~~ polar lipids 
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FIGURE 3: Suggested disposition of the MHV E1 polypeptide across the membrane. Symbols: () a-helix (Eisenberg et al., 1984; Lim, 1974); 
(O00) B-sheet (Lim, 1974); (C) B-bend predicted by one method (Chou & Fasman, 1978); (@) 8-bend predicted by two methods (Chou 
& Fasman, 1978; Cid et al., 1982); (0) location of the four glycosylation sites. Lumenal and cytoplasmic faces of the microsomal membrane 
are topologically equivalent to outer and inner faces of the virion membrane, respectively. 


of the characteristics of such a sequence. Our previous ex- 
periments indicated that the signal for membrane insertion 
of MHV-AS59 El could be located anywhere within its first 
150 amino acids (Rottier et al., 1984, 1985). There is no 
sequence homology among published signal sequences, but 
unlike the extreme N-terminal sequences of E1, they are all 
hydrophobic and devoid of charged residues (von Heijne, 
1981a). It is therefore likely that the SRP-specific membrane 
insertion signal resides in one of the membrane-spanning hy- 
drophobic regions of E1. Possibly, the strong conservation in 
the first two membrane-spanning segments of the mammalian 
and avian viral El sequences pertains to such a function. The 
involvement of more than one functional insertion signal cannot 
be excluded, since the addition of SRP to an in vitro translation 
of El mRNA did not lead to the accumulation of a specific 
arrested peptide (Rottier et al., 1985). The occurrence of 
internal signal sequences is not without precedent. They have 
been postulated for several other membrane proteins (Braell 
& Lodish, 1982; Anderson et al., 1983; Chin et al., 1984; 
Holland et al., 1984) as well as for a secreted protein (Lin- 
gappa et al., 1979). 

Combining the results of our theoretical predictions with 
the biochemical data on the MHV-AS59 E1 protein assembled 
in microsomal membranes (Rottier et al., 1984), we arrive at 
the model shown in Figure 3. On the lumenal side of the 
microsomal membrane, which corresponds to the outside of 
the virion, an N-terminal region of about 2.5K is exposed and 
thus susceptible to proteolysis. The MHV sequence contains 
here potential O-glycosylation sites clustered at positions 2-5, 
which in the mature protein probably become positions 1-4 
by the removal of the terminal methionine [see Housman et 
al. (1970)]. O-Linked sugars often appear in clusters near 
one end of the molecule [see Sharon & Lis (1981)]. In gly- 
cophorin, for example, all of the 15 O-glycosylated hydroxy 
amino acids are found in the first 50 N-terminal residues, 
particularly in the extreme terminal portion where each of 
residues 2-4 and 10-15 carries an oligosaccharide unit (Tomita 
et al., 1978). MHV El enters the lipid membrane around 
residue 26 as judged from the size of the protease-sensitive 
fragment and the primary structure at this point. Then, the 
polypeptide chain traverses the membrane three times. It 
emerges from the cytoplasmic face of the lipid bilayer with 
the surface helix around residue 109; this probably represents 
the domain that is slightly accessible to proteases (Rottier et 
al., 1984). Then, a remarkably long stretch of amino acids 
follows for which, apart from a few B-bends, no secondary 
structure is predicted. This part of the protein is again ex- 


tremely protease resistant if synthesized in the presence of 
microsomal membranes (Rottier et al., 1984). Therefore, we 
postulate it to be embedded in the surface of the membrane, 
together with the second surface a-helix (residues 182-192). 
An alternative possibility is an interaction with the trans- 
membrane a-helices. Finally, the polypeptide becomes exposed 
again, the 6-bend at positions 214-218 probably marking the 
start of the protease-sensitive C-terminus. 

The intracellular budding of coronaviruses is supposed to 
involve a highly specific recognition between nucleocapsids in 
the cytoplasm and El molecules accumulated in the endo- 
plasmic reticular membranes. The C-terminal part of the El 
proteins is located at the cytoplasmic side of the ER and is 
thus available for interaction with the nucleocapsid protein 
and the genome. The presence of almost all basic amino acids 
in this very part of the protein supports this view and may 
explain the general affinity of El for RNA (Sturman et al., 
1980). 
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ABSTRACT: Our attempts to develop adrenocorticotropic hormone (ACTH) analogues that can be employed 
for ACTH receptor identification and isclation began with the synthesis of ACTH fragments containing 
N‘-(dethiobiotinyl)lysine (dethiobiocytin) amide in position 25 to be used for affinity chromatographic 
purification of hormone~receptor complexes on Sepharose-immobilized avidin resins. Because labeling ACTH 
or ACTH fragments by conventional iodination techniques destroys biological activity due to oxidation of 
Met‘ and incorporation of iodine into Tyr’, we have prepared [Phe?,Nle*] ACTH)_»4, [Phe?,Nle*, biocy- 
tin?>] ACTH)_.; amide, and [Phe?,Nle*,dethiobiocytin*>] ACTH,_; amide by conventional synthetic techniques. 
The HPLC profiles and amino acid analyses of the final products indicate that the materials are of a high 
degree of purity. The amount of tertiary butylation of the Trp residue in the peptides was assessed by NMR 
and was found to be less than 0.5%. All three peptides are equipotent with the standard ACTH)_,, as concerns 
their ability to stimulate steroidogenesis and cAMP formation in bovine adrenal cortical cells. Iodination 
of [Phe?,Nle*] ACTH _,4, with iodogen as the oxidizing agent, has been accomplished without any detectable 
loss of biological activity. The mono- and diiodo derivatives of [Phe?,Nle*]ACTH,_»4 have been prepared, 
separated by HPLC, and assayed for biological activity. Both peptides have the full capacity to stimulate 
steroidogenesis and cAMP production in bovine adrenal cortical cells. 


I. is the ultimate aim of this investigation to gain information 
regarding the chemical nature of ACTH! receptors. Con- 
ventional approaches to this problem have thus far been un- 
successful; in particular, the receptor appears to lose its affinity 
for ACTH upon solubilization. We are exploring a combi- 
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RR 00292. 


nation of affinity labeling and affinity chromatographic 
techniques based on the avidin-biotin interaction to identify 


1 Abbreviations: ACTH, adrenocorticotropic hormone; Bet, biocytin; 
Boc, tert-butoxycarbonyl; CCD, countercurrent distribution; DCC, N,- 
N’-dicyclohexylcarbodiimide; DIPEA, N-ethyldiisopropylamine; DMF, 
dimethylformamide; DTBct, dethiobiocytin; HPLC, high-pressure liquid 
chromatography; OBu', tert-butyl ester; OSu, N-hydroxysuccinimide 
ester; TCA, trichloroacetic acid; TFA, trifluoroacetic acid; TLC, thin- 
layer chromatography; Z, benzyloxycarbonyl. 
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